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Lensing of the CMB generates a significant bispectrum, which should be detected by the Planck 
satellite at the 5-sigma level and is potentially a non- negligible source of bias for /nl estimators 
of local non-Gaussianity. We extend current understanding of the lensing bispectrum in several 
directions: (1) we perform a non-perturbative calculation of the lensing bispectrum which is ^ 10% 
' more accurate than previous, first-order calculations; (2) we demonstrate how to incorporate the 

I signal variance of the lensing bispectrum into estimates of its amplitude, providing a good analytical 

^ . explanation for previous Monte-Carlo results; and (3) we discover the existence of a significant 

lensing bispectrum in polarization, due to a previously-unnoticed correlation between the lensing 
potential and i5-polarization as large as 30% at low multipoles. We use this improved understanding 
of the lensing bispectra to re-evaluate Fisher-matrix predictions, both for Planck and cosmic variance 
limited data. We confirm that the non-negligible lensing-induced bias for estimation of local non- 
Gaussianity should be robustly treatable, and will only infiate /nl error bars by a few percent 
, over predictions where lensing effects are completely ignored (but note that lensing must still be 

accounted for to obtain unbiased constraints). We also show that the detection significance for the 
lensing bispectrum itself is ultimately limited to 9 sigma by cosmic variance. The tools that we 
\ develop for non-perturbative calculation of the lensing bispectrum are directly relevant to other 

calculations, and we give an explicit construction of a simple non-perturbative quadratic estimator 
for the lensing potential and relate its cross-correlation power spectrum to the bispectrum. Our 
numerical codes are publicly available as part of CAMB and LensPix. 
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I. INTRODUCTION 



, The large-scale CMB temperature anisotropy has a contribution from the blue- and red-shifting of photons as they 

■ fall in and out of potential wells between the last-scattering surface and our observation. This integrated- Sachs- Wolfe 
, (ISW) effect is not present during matter domination, but becomes important at redshift z < 2 at which dark energy 

■ starts to affect the evolution of the matter perturbations. The CMB is also gravitationally lensed by structures along 
04 \ the line of sight, with most of the effect also coming from z < 2, so there is a correlation between the ISW signal and the 

i CMB lenses. The effect of an ovcrdcnsity is to magnify the last-scattering surface, effectively locally shifting the scale 
of the acoustic peaks. The variance over some range of scales is therefore changed by the magnification if the spectrum 
^-H . is not flat. This leads to a correlation between the small-scale CMB power and the large-scale lenses, and hence a 
' correlation between the large-scale CMB temperature and the small-scale power. This corresponds to a 'squeezed' 
^ , bispectrum shape — it is the correlation of one large scale with two much smaller scales. The lensing bispectrum falls 
off rapidly as the largest scale decreases, since the ISW contribution to the temperature falls rapidly on smaller scales. 
. ^ The existence of a significant temperature bispectrum is well known, and must be modeled when trying to detect 

■ small levels of local primordial non-Gaussianity It can also be used as a probe of the perturbation growth and 
" " ' expansion history of the universe at low redshift, and hence help to constrain the dark energy and curvature p|-p^. 

Calculations of the temperature lensing bispectrum have until now been calculated at lowest order in the lensing 
effects, although some simulation work has also been done to verify that the effect of higher-order terms is small 
In this work, we demonstrate how to extend these calculations non-perturbatively to higher-order by working in 
an 'unlensed short-leg' approximation, where we take one large-scale mode of the CMB temperature to be unlensed. 
This produces higher-order corrections to the usual lensing result which may be accurately reproduced simply by 
replacing the unlensed power spectra which appear in the lowest-order calculation of the lensing bispectrum with 
their lensed counterparts. This results in 0(10%) corrections to the lensing bispectrum which we verify using Monte- 
Carlo simulations. 

The lensing bispectrum should be detected soon at high-significance (e.g. ^ Scr in the data of the recently-launched 
Planck satellite |^). In this regime, the cosmic variance of the lensing signal can have large effects on the expected 
error of the bispectrum amplitude. Calculation of the increase in error at first appears daunting as it involves a six- 
point function in the non-Gaussian, lensed CMB, however we will show how a heuristic interpretation of the lensing 
bispectrum estimator as a cross-correlation between the observed CMB temperature and a quadratic reconstruction of 
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the Icnsing effects can be used to intuit an accurate approximation to the signal variance. This method also generalizes 
straightforwardly to a calculation of the increase in variance for other estimators of non-Gaussianity, where the bias due 
to lensing represents an additional effective source of noise. This increase has already been investigated numerically 
by Ref. under the assumption that the amplitude of the lensing bispectrum is well constrained and may simply 
be subtracted from the data. We are able to reproduce this result analytically, as well as extend it to the case where 
the amplitude of the lensing bispectrum is treated as a free parameter and marginalized over directly from the data. 
Our discussion also leads to improvements to standard bispectrum estimators, which incorporate the signal variance 
appropriate to the lensing bispectrum. 

Discussion of the CMB lensing bispectrum in the literature has focused on the temperature anisotropics since there 
is no direct analogue of the ISW effect in polarization. However, as we will show here, the large-scale E polarization 
from reionization is also directly correlated with the z < 3 matter distribution, giving a correlation between the 
-E-polarization and lensing potential at up to the 30% level. This generates a significant polarized lensing bispectrum, 
detectable at ~ 2.5a with cosmic-variance limited data. We present the first calculations of these effects, and generalize 
our analytical non-perturbative bispectrum and variance calculations to the polarization case. Including this effect in 
a fit for the amplitude of the lensing bispectrum would increase the significance with which it is detected from 3.8(t 
to 4.5(7 for Planck, or from 5.3cr to 8.3cr for an experiment which is cosmic-variance limited to Imax = 2000. 

The outline for this paper closely follows the description above. In Section || we review the quantitative description 
of lensing effects as a remapping by the gradient of a lensing potential tp, and derive the cross-correlation between 
the lensing potential and the CMB temperature and polarization. In Section |l| we then present calculations of the 
lensing bispectrum on the fiat-sky, both at first order in the lensing potential as well as in the short-leg approximation 
which is effectively accurate at higher order as well. Use of the flat-sky expressions makes it straightforward to 



gain an intuition for the terms involved. In Section [V we proceed to give full-sky results for both temperature 
and polarization, which generalize straightforwardly from the flat-sky limit. In Section ^ we discuss the variance of 
estimators for the lensing bispectrum, and the increased variance for other non-Gaussian bispectra which occurs when 



marginalizing or subtracting the lensing contribution to avoid biases. Our conclusions are summarized in Section VI, 
and the details of several more involved calculations are contained in appendices. Throughout we assume a standard 
ACDM cosmology, and for numerical examples use a constant spectral index spatially-fiat model with ftbh^ = 0.0226, 
ilji^ = 0.112, h = 0.7, As = 2.1 X 10^*^, 7is = 0.96, t = 0.09, and approximate the three neutrinos as massless. 

II. THE LENSING POTENTIAL AND ITS CROSS-CORRELATION WITH TEMPERATURE 

ANISOTROPY AND POLARIZATION 

The effect of gravitational lensing is to alter the direction of propagation of photons such that when we look in 
direction fi we are actually seeing photons that originate from n + a on the last-scattering surface, where a is a 
defiection angle. Using the Born approximation, the deflection angle of a source at conformal distance x* is given in 
terms of the Weyl potential 4* (i.e. the average of the Newtonian-gauge potentials) by the line-of-sight integral 

a = -2 dx /f^::f\ vnvl/(;^n; % - x), (1) 

Jo JK[X*)JK[X) 

where Vn represents the angular derivative, equivalent to the covariant derivative on the sphere deflned by fi. The 
quantity 770 — x is the conformal time at which the photon was at position xn, and fxix) is the comoving angular- 
diameter distance. It is convenient to define the lensing potential, 

^{h) EE -2 r dx T7%^*(XA; ^0 - X), (2) 
Jo Ik[x*)Jk{x) 

so that the defiection angle is given by VnV^. From now on we write this simply as Vtj}. For full derivations and 
review see Refs. [|ll|, p^ . 

Since the lensing potential is a weighted integral of the Weyl potential along the line of sight, it is correlated to the 
ISW contribution to the CMB temperature given by 

Arisw(n) = 2 r dx*(xn; 7?o - x), (3) 
Jo 

where the dot denotes a conformal time derivative. In concordance ACDM models, ATisw and -0 are highly correlated 
(at above the 90% level) due to the similarity of their redshift kernels, which leads directly to a correlation between 
the total CMB anisotropy and the lensing potential. The full result for the angular power spectrum Cf^ can easily 
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FIG. 1: The geometry of the polarization signal generated by scattering at reionization: the figure is roughly to scale in comoving 
distance, with reionization (shading) starting at z ~ f f , about 2/3 of the way to the last-scattering surface (dark red outer 
shell). After the universe has reionized, the probability of scattering falls off as the universe expands, so most scattering occurs 
between ~ 1/2 and --^ 2/3 of the distance to recombination. An electron at reionization sees its own last scattering surface as 
indicated by the red shell, and Thomson scattering of the quadrupolar component of the distribution of photons originating 
from this surface generates i?-polarization. For an electron at the start of reionization its last scattering shell extends from our 
last-scattering surface down to a redshift of about z ~ 2. Perturbations that generate a large-scale polarization signal will be 
correlated on large-scales, and therefore be correlated to perturbations at z < 2 (see Fig. ^). 




FIG. 2: An illustration of the correlation between the quadrupole component of the polarization from reionization and the 
local density quadrupole, here generated by a superposition of two orthogonal plane waves. 

be calculated numerically, and typical results are shown later in Fig. ||; in total the correlation is nearly 50% at Z = 2, 
but decreases rapidly with scale as the ISW contribution to the total diminishes, giving only a few percent correlation 
by I = 100. 

The story with the polarization is rather different. The temperature quadrupole generated by the ISW can re- 
scatter leading to a correlated polarization signal; however as shown in Ref. this signal is tiny because there is 
little scattering at the low redshifts where the ISW signal becomes significant. The dominant correlation is actually 
between the lensing potential and the large-scale polarization £^-modes generated by scattering at reionization. At 
redshift z ~ 11 where reionization occurs, i?-mode polarization is generated by Thomson scattering of the local 
radiation quadrupole. This quadrupole has contributions from a wide range of redshifts (for the observer) , overlapping 
with the region 1 ^ z ^ 6 from which the CMB lensing potential is sourced, and is correlated over long distances. 
This is illustrated more concretely in Figs. |^ and ||, and plots of the cross-spectra and correlation coefficient are 
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FIG. 3: The power spectrum (left) and dimensionless correlation coefficient (right) for the correlation of the CMB lensing 
potential with the CMB temperature anisotropy (top) and i?-mode polarization (bottom) for a standard ACDM cosmology 
with an optical depth to reionization r = 0.09 (solid lines) and r = (dashed lines). Note that without reionization the 
S-polarization power spectrum on large scales is very small, so although the correlation is still significant the actual size of the 
signal is very tiny. The T-ip correlation is due mainly to the ISW effect; the E-^ correlation to the reionization signal. 

given in Fig. ^ The large-angle E-ip correlation is negative and so produces radial polarization around large-scale 
overdensities. Further discussion of the E-^p correlation is given in Appendix ^ where a simple analytic model which 
reproduces the main features of Fig. ^ is developed for the case of instantaneous reionization. With cosmic-variance 
limited full-sky and E, Cf^ could be detected as non-zero at approximately 2.5 sigma (compared to nearly 8 sigma 
for Cf^ from ip and T). 

Although the ISW effect does not directly generate the correlated E'-polarization signal (reionization occurs well 
before dark energy becomes dynamically important), there is nonetheless a significant indirect correlation between 
the ISW and E because, as we have noted, the lensing potential is highly correlated to the ISW signal. Indeed the 
Cf^ correlation at large scales is suppressed by about 1/5 due to the (anti-)correlation between the ISW signal and 
the E polarization. Note that the latter has the same sign as the Cf^ correlation. Accurate numerical calculations 
of both Cf''' and Cf are now included in CAMB^ 0. 

III. FLAT-SKY CMB TEMPERATURE LENSING BISPECTRUM 

To understand the basic shape of the lensing bispectrum it is useful to start by considering the simple case of 
the CMB temperature in the flat-sky approximation. We follow the flat-sky notation and conventions of Ref. [ pT| . 
Assuming statistical isotropy and that parity invariance holds in the mean, the reduced bispectrum bi-^i^i^ can be 
defined as 

{f{h)f{h)f{h)) = :^S{h +h + h)biMs , (4) 



ittp : // camb. info 
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where T(l) is the Fourier transform of the lensed temperature, and the delta-function ensures the triangle constraint. 
The reduced bispectrum is symmetric in its arguments and it is therefore convenient to restrict the values of li, I2 
and ^3 such that h < h l£ h- with other combinations obtainable by permutation. 



A. Leading perturbative result 



Lensing remaps the temperature anisotropics so that the lensed temperature field T(x) is related to the unlensed 
field T(x) by T(x) = T(x + Vi/'). Fourier transforming and performing a series expansion to first order in ip gives^ 



f (1) = T(l) 



d^li d^L 

"27r~27r 



T(L)^(li)li •L(27r)5(li+L-l). 



(5) 



Assuming Gaussianity of the lensing potential and the CMB temperature anisotropics, to first order in ip we then 
obtain the three-point corr elation |l§ 



(f(ll)f(l2)f(l3)) 



6{h +h + I3) (li • 12)C,7q7 + 5 perms 



27r 



(6) 



This is the standard first-order result for the lensing bispectrum; as we shall see higher-order corrections result in 
corrections at the 10% level. 



B. Unlensed short-leg approximation and non-perturbative result 

As we have described, the physical origin of the bispectrum signal is the small-scale power changing due to 
(de)magnification and shearing by large-scale lenses. If we consider a lensed CMB sky, and add an additional large- 
scale lens, it will look substantially similar, but re-sized. We would therefore expect the power spectrum of the 
small-scale fluctuations over the extent of the large-scale lens to be determined by a shifted version of a lensed power 
spectrum. In the lowest-order result we calculated above, the expression for the bispectrum involved the unlensed 
small-scale temperature spectrum, but since the result is only lowest order we can expect higher-order corrections on 
small scales. 

Since the lensing potential correlations fall ofi^ rapidly at high I, all of the bispectrum signal is at small li {li < 100, 
where we restrict to h < h ^ h)- Since the lensing effect on the temperature at I < 100 is very small, to a good 
approximation we can calculate the bispectrum neglecting the lensing of the short leg ^i, i.e. (T(li)r(l2)r(l3)) ~ 
(T(li)T(l2)r(l3)). As we will show, many of the results in this paper may be verified numerically with Monte-Carlo 
simulations, and in all squeezed-shape cases we have checked, short legs have proved more than adequate. A more 
general approximation may be required to accurately assess the lensing bias on non-squeezed bispectra. 

Using the fact that the lensed temperature is linear in T, and integrating the Gaussian expectation by parts, we 
have 

{T{h)f{h)f{h)) Cl^ (f (l2)f (13))^ , (7) 

where the ensemble average is taken over realizations of both the CMB and the lensing potential. The only approx- 
imation here is that T(li) is uncorrelated with the unlensed temperature modes that contribute to T(l2) and ^(la). 
Using 

^f(l).-^l,.W(l + l,), (8) 



^ We are assuming the unlensed CMB is a single source plane at recombination governed by a single lensing potential. This is not quite 
correct on large scales because the ISW contributions are more local; however the bispectrum is only significant for small-scales of the 
lensed field, so we can neglect this complication to good accuracy. 



6 



9 



0.08 
0.06 
0.04 
0.02 



-0.02 
-0.04 

0.15 
0.1 
0.05 



-0.05 
-0.1 

0.06 
0.04 
0.02 



-0.02 
-0.04 
-0.06 
-0.08 



TT 

\ \ 
\ \ 


1' 

'a' I ' 
' ' 'a' 

'A' 

' - 

— f — I — J — 


\ \ 

^ V 

\\ 


' ' ' 
f ' ' 

'/ V '/ 
'/ \ 


EE 


H 


V / \ / 




TI— 

A 









200 400 600 800 1000 1200 1400 1600 1800 2000 



FIG. 4: The fractional difference between the flat-sky lensed temperature power spectra and the unlensed power spectra for 
Cf^ /Cf^ — 1 (top; dashed), Cf^'^ /Cf^ — 1 (top; solid) and analogously for the polarization in the lower two plots (see 
Appendix The lensing bispectrum depends on Cf^"^, with Cf^"^ ~ Cf^ to within about one percent. 



where we introduced the lensed temperature gradient, Vr(x) = (Vr)[x + ViA] and its Fourier transform, we then 
have 

(9) 
(10) 



^-^[T{h)T{h))^ = --\,.(vn\,+h)nh)) + [h^h) 

= +h + h) [(li • l2)Q7^ + (li • h)Cl 



Here we have defined the power spectrum Cf^^ by 

(VT(l)f (1')) = ilCf'^^Sil + 1'), 



(11) 
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so that 



ir 



1 • vr(i)f'(i') 



TVT 



S{l + l') 



(12) 



The expression for (T(li)T'(l2)r(l3)) then follows simply from Eq. (^. In the absence of lensing, Cf^^ reduces to 
the usual temperature power spectrum. With lensing, to the extent that gradients and lensing commute, C^^^ is 
reasonably well approximated by the lensed power spectrum. Indeed, in Fig. ^ we show numerically that approximating 
Cf^^ ~ Cf^ is correct to about the percent level. For the temperature bispectrum this then gives 



'1 



(ii 



h)cr 



(ii 



h)cr 



(13) 



In Appendix |^ we show explicitly that this nou-perturbative relation agrees with a direct perturbative calculation 
to third order in ip. Figure ^ shows the effect of the higher-order corrections, effectively smoothing out the lensing 
bispectrum at the 10% level; this may be important to estimate correctly the contribution of CMB lensing to estimators 
for other forms of non-Gaussianity, and also for using the lensing bispectrum to obtain cosmological constraints. 

Note that Eq. (10) for the response of the lensed CMB covariance to a mode of the lensing potential differs from 
that which is usually derived at lowest order in the lensing potential, e.g. for quadratic estimators in which the 
unlensed spectra appear rather than (effectively) the lensed spectra. The neglect of these higher-order contributions 
leads to a bias in standard quadratic lensing estimators, which is more rigorously calculated in Rcf. [^8|. The non- 
perturbative response of the lensed covariance to a mode of the lensing potential which we present here provides a 
faster, more intuitive way to arrive at the same result. 

Finally wc can easily construct an accurate approximation for the lensing bispectrum which is non-perturbatively 
correct if the short-leg approximation holds, and also agrees with the perturbative result to leading order even if it is 
violated: 



(li-l2)C,^^Cr'' + 5perms. 



(14) 



C. Squeezed limit 

Since Cf^ rapidly becomes small on small scales, the bispectrum is nearly zero unless h is small. However the 
lensing deflection angles are small, a few arcminutes, so the lensing only has a significant effect on T(l2) on small 
scales {I2 ^ 1). Hence almost all of the bispectrum signal is in squeezed triangles with ^ ^2 ~ '3- If we consider 
the ultra-squeezed limit we can define 1 = (I2 — l3)/2 = I2 + li/2 = — 13 — li/2 and expand in the small quantity li/l 
giving the leading terms for the reduced bispectrum. 



(ll • l2)Q7 (ll • Is)^;^ 



(ii ■ 1)^ dcT 
din; 



COS 20;^ ; 



dcr 

dln^ ^ P dij-^; 



1 d(pcr) 



(15) 



The partly quadrupolar dependence on the angle between the large-scale and small-scale modes is very dif- 
ferent from the isotropic squeezed limit expected from primordial modulations (e.g. the local /nl model), mak- 
ing the quadrupole part of the lensing signal orthogonal. For li and 1 parallel, the signal is proportional to 
C';"^"^dln(;C;"^"^)/dlnZ, reflecting the change in small-scale power due to shifting of scales by lensing (de)magnification 
and shearing. Since the spectrum has acoustic oscillations, the derivative term oscillates in with a phase shift com- 
pared to the power spectrum. For li • 1 « (i.e. /a = ^0{l\ll-2)^\) the derivative term is small and the bispectrum 
is generally of much smaller amplitude and has the same phase of acoustic oscillations as the power spectrum. The 
phase shift of the dominant lensing bispectrum signal compared to the phase of the acoustic oscillations is rather 
distinctive, and different from that expected for any primordial bispectrum of adiabatic perturbations. The strong 
scale-dependence (very little signal for Ix > 100) is also different from standard local non-Gaussianity models; see 
Figs. H and ||. However as we shall see the isotropic part of the lensing bispectrum signal does have significant overlap 
with the local /nl model, so although it is easily distinguished it is also important to model when studying local 
primordial non-Gaussianity. 
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I 

FIG. 5: The CMB temperature lensing reduced bispectrum b4,i^i+4 using the squeezed-hmit approximation of Eq. (^) with 
the unlensed (thin blue dot-dashed) and lensed (thin blue solid) small-scale power spectrum, compared to the full spherical 
result of Eq. p3tj using the lensed small-scale spectrum (thick red). For comparison the thick black line shows the result for 
a local-model primordial bispectrum with /nl = 10 (which has not itself been lensed here; see Ref. |^); note the difference in 
phase and scale-dependence. 



Wc can also derive the squeezed limit following an argument similar to Refs. |T^-^ by considering one fixed very 
large-scale lensing mode of the magnification matrix A, where 

dct ■ 

^rj = + ^ = (1 - k)S,, - 7,,. (16) 

Here, k is the convergence and jij is the symmetric, trace-free shear. Since for a local displacement C we have 
T(C) = T{A(), it follows that taking the average with fixed A we have 



™mi3)>=q7-.,i^^ 

Expanding to first order in the convergence k and shear matrix 7 gives |^ 

, dln(^iC,7) 



{T{h)T{h)) = Q7<5(l2 + I3) 



dln^. 



(17) 



(18) 



If there are also small-scale lensing modes then approximately the same result is obtained, with the power spectra 
replaced by the lensed CMB power spectra, in agreement with Eq. (|l^) when correlated with T(li). Equation(|T^) 
makes clear the different shape dependence of the convergence and shear effects: a scale-invariant spectrum looks the 
same under uniform magnification, but shear introduces observable distortion to the hot and cold spots (only the 7 
term contributes if I'^Cf^ = const.); a white-noise spectrum looks the same after shearing, but the noise amplitude 
is changed under magnification (only the k term contributes if Cf" ~ const. )^. 

IV. GENERAL FULL-SKY CMB LENSING BISPECTRA 

We now present the generalization of the lensing bispectrum calculation of the previous section to the full-sky 
and polarization. Further details are contained in Appendix Following Ref. the lensed field is given by 



^ The squeezed-limit form of the bispectrum here disagrees with Ref. pif which has incorrect factors in the anisotropic term. The 
result given in Ref. ps) is in agreement in the matter-dominated Sachs- Wolfe limit. 
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FIG. 6: Contour plot of hhlaih + h + I3)biii2i3 (with non-linear intervals) for ISW-lensing (left) and local-model primordial 
non-Gaussianity (right; different overall scale). Although both are peaked for squeezed configurations, there are large phase 
and shape differences. 
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(20) 
(21) 

(22) 



Note that FT^f , is only non-zero for I1 + I2 + I3 even and is then symmetric in I2 and I3: F, f , is only non-zero for 
^1+^2 + ^3 odd and is then antisymmetric in I2 and ^3. We shall assume there are no unlensed B modes, so that Bim 
is due entirely to lensing. The non-perturbative flat-sky derivation generalizes directly to the curved-sky case; we 
implement the result here by simply using the lowest-order series-expansion result and then replacing the unlensed 
power spectra with their lensed counterparts. The leading-order lensing-induced 3-point function, using the lensed 
power spectra for the small scales to reproduce accurately the non-perturbative calculation, is then given by 



5 perms. , 



(23) 



where se = s_b =2 and st = 0, and E = —B, B ^ E and T = 0. Equation 
permutations of i{lm). The bispectrum then follows from 



h h h 
mi 7712 fTT-s , 

3D includes a sum over all six 



B 



ijk 



h h h 



E > 



('^il771i'^^277l2"'3 77l3)- 



(24) 



Note that under interchange of a pair of arguments, e.g. ili O jl2, the bispectrum changes by a factor (— i)'i+'2+'3^ 
If the bispectrum involves a parity-odd combination of fields, e.g. {TBE), parity-invariance in the mean requires non- 
zero bispectra to have + ^2 + '3 odd and hence to change sign under interchange of a pair of arguments. Furthermore, 
non-zero bispectra with li + ?2 + '3 odd are necessarily imaginary. 
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Results for the polarization bispectra have been derived before (e.g. Ref. flgl): however previous calculations have 
invariably set the large-scale Cf^ = 0, missing a signal detectable at several sigma with cosmic- variance limited data, 
and the power spectra have usually been the unlensed ones, giving a systematic error of 0(10%). We show several 
slices through the temperature and polarization bispectra in Figs. 0, & ^ using both analytical calculations as well as 
simulations using the Monte-Carlo procedure outlined in Appendix O, testing the accuracy of the unlensed short-leg 
approximation and the use of lensed power spectra rather than e.g. Cf^^. We demonstrate in Appendix ^ that for 
the polarization case, the non-perturbativc calculation involves a new spectrum Cf^-^ as well as Cf^^ and CP'^^ . 
Terms involving this spectrum are missed in the approximation of replacing e.g. C^'^^ by C^^. However, this is 
harmless since Cf^^ is of similar magnitude to C^^ and the error from neglecting such terms is small compared to 
the change in C^^ due to lensing (which is the dominant correction to the leading-order bispectra). 

For the temperature, the reduced bispcctrum bi-^i^i^ is defined so that 

rTTT / (2/i + 1)(2Z2 + 1)(2/3 + 1) ^/i h h\, 

BiM, = \j (^0 oj^'^'^'- ^^^> 

where bi-^i^i.^ is taken to be zero for Zi -|- /2 -I- I3 odd. This generalizes straightforwardly to bispectra involving only 
T and/or E since parity- invariance forces the bispectra to vanish for li -\~ I2 + I3 odd. However, there docs not 
appear to be a standard equivalent definition for the bispectra involving a product of fields with net odd parity. 
For sufficiently sensitive data, these "odd-parity" bispectra are well measured because of the expected absence of 
small-scale primordial i3-modes; this is equivalent to the lensing reconstruction from E~B correlations having lowest 
statistical noise, and hence correlating well with the large-scale temperature (and polarization). Since gravity waves 
decay on sub-horizon scales, the squeezed "odd-parity" CMB lensing bispectra are yet another way in which the CMB 
bispectra are very different from any primordial source. With sufficiently low noise, the large-scale lensing potential 
can be reconstructed very well using the small-scale E and B polarization ||l^, , so it would be straightforward to 
project the correlated component out of the temperature and polarization data and thereby remove CMB lensing as 
a source of contamination for other signals. 



V. ESTIMATORS, VARIANCE AND BIAS 



We now turn to a discussion of optimal estimators for the lensing bispcctrum and their variance. As reviewed in 
Appendix ^ in the case of an isotropic survey (full sky and uniform noise) the optimal estimator for the amplitude 
of a bispcctrum template B is, for small signals, given by 



S=T; J2 VeCp(B,^;^;3)tC0V;J^,3VeCp(B;^i^;3) = 



li<l2<h 



6F ^ 

hhh 



-lyp 



'l\kr ftpqr 



(26) 



where vecpi^i^i^i^) is the vector of distinct elements of B^^^^j^ with covariance matrix Covj^f^iai C'tot'i is the total 
cross-power spectrum including noise, and F^^ ~ (S*^) is the inverse of the Fisher error in the limit of no non- 
Gaussianity, given by 



P— 1 /rjyfc \*(r'~l\ip(r'-l\jq(/^-l\kr npqr 

Q ^ hhls' y'~'tot)li\^tot)l2\'~-'tot)h -^hl2l 



hhh 



E 

h<h<h 



^(ii\i3('^/7ti3)*(^tot)if (C'tot)0(C'tot)zr^f'^^ (27) 



Here Ai^j^;., = 6d^^]^Sl^5'^^^ (with no implicit sums over Z-labels): Ai-^i^i^ is 6 if Zi = ^2 = ^3, 2 if two of the indices are 
equal, and 1 otherwise. 

The Fisher error in Eq. ( p7| ) was calculated for Gaussian a^^^, but in the presence of lensing the variance is 
necessarily larger since there is a guaranteed signal and this itself has some variance. We will motivate an expression 
for this increase in variance by recasting the estimator for the lensing bispectrum as a cross-correlation between a 
quadratic estimate of the lensing potential and the CMB itself. We begin for simplicity in Section V A by considering 
the temperature-only case. Polarization is a straightforward generalization and is presented in Section VB. Then 
in Section V C we combine these results to determine the significance with which the lensing bispcctrum may be 
detected. In Section VD we generalize our results further, to the case where the lensing bispectrum is used in a joint 
analysis with other bispectra, as a source of bias to be subtracted or marginalized over. 
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FIG. 7: The reduced bispectra h^^^j^^ for various temperature and i?-polarization combinations. Left: Thin sohd (red) hues 
show the lowest-order result for the CMB lensing bispectrum; thick solid (blue) show the approximate non-perturbative result 
using the lensed CMB power spectrum. Dash-dotted (black) lines for comparison show the result for a local-model primordial 
bispectrum with /nl = 30. Right: The difference between the non-perturbative bispectrum and the lowest-order result, in units 
of the maximum of the absolute value of each bispectrum. Smooth (red) lines show the theoretical approximation of Eq. (p3[); 
noisy (blue) lines are results from 1000 simulations, smoothed over A/ = 10. 



Temperature 



Equation ( p6| ) for the amplitude of the lensing bispectrum from temperature data can be rewritten as 

\-ll<l2<l3 



5=1^ Tli™ 



F 



hmi tot 'i 



h h h 



mi 7712 mgy (jTT ^TT 

I2I3 rn2rn-3 ^ ^ tot ''2 tot 



(28) 



The term in square brackets is (proportional to) a quadratic estimator for the lensing potential V'/^mi F^ - To see 
this, consider taking the expectation value of this term over noise, small-scale modes of the unlensed temperature. 
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FIG. 8: Same as Fig. ^ but now showing fesoa+so' "^here results on the right are from 700 simulations and are unsmoothed. 



and the modes of the lensing potential with (Im) ^ (ZiTOi). We have that 



El h h ^3 \ rfi \ ( ^1 ^2 ^3 

I m, mo mn ; \ '2m2-'J3m3/(im)/(iimi) 



TOl 7712 1TL3 



E 

ATT 

2h + 1 ' 



TOl TO2 nisj \ S^p* 



limi ' 



/ 1 m 1 



Tl2m2 Tl 



(29) 



which is correct to first order in modes of ?/' at li and non-pcrturbativcly correct in its other modes. Here, 



'-iTT rpO 



(30) 



which is related to the squeezed limit of the lensing bispectrum by i^Ziiaig = ■^m^- foUows that there is a 
quadratic estimator for i^i^m^ of the form 



A^"^ ( ^ Ti2m2Tl3m3 

h hhh hhls \ m, rn-^ m-i I r<TT. nTT. 



(31) 
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FIG. 9: The "odd-parity" bispectra -^(-l)'^!^, j+g (left) and -^(-l)'^^ ;,;+49 (right) comparing the theoretical result of 
Eq. (|23[) (smooth line; red) to the result (noisy hue; blue) of 700 (left) and 150 (right) simulations. 



where 



(O)i-l 



1 



2^1 + 1 



li<U<h 

E 



A 



hh 



h hh i^TT riTT 
^tot '2 '-'tot ^3 



(32) 



This estimator satisfies {ijJiimi) (ivi}^{iimi) = i'hmi to first-order in iphmi but is non-perturbatively correct in the I ^ li 
modes of ip. It is a non-perturbative version of the usual quadratic estimator avoiding the low-^ (W'^^') bias in 
the standard estimator that was identified by Ref. and generalizing the perturbative corrections of Ref. [|l8| to a 
non-perturbative form by simply using the lensed small-scale power spectra in the filter functions. In the Gaussian 
limit, the variance of the estimator is simply N^^'' and the weighting in I2 and I3 in Eq. ( |3l] ) can be shown to minimise 
this Gaussian variance subject to the estimator being unbiased. 

We can now rewrite the estimator of the bispectrum amplitude in terms of the ip reconstruction as 



S 



p E '^h 



imi 



/imi 



r'TT 



tot h N. 



(0) ' 



(33) 
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and the normalization 

F « ^(2/, + 1) {cr^y {cn^y (<)" . 04) 

Recasting the bispectrum estimator in this form leads directly to an understanding of the contribution to the error 
from signal variance. The estimator 5' depends on the empirical cross-power Cf^^ between the "0 reconstruction and 
the large-scale observed temperature: 

^^^^ ^ ^n^Y.^h^^^^t.m. ^ (35) 

^ mi 

Since {Cj^) — Cj^^ , each 5*;^ is an unbiased estimate of the bispectrum amplitude. As with any other power spectrum 
estimator, has uncertainty both from reconstruction noise and from signal/cosmic variance: 



var CT^ 

11 



1 



2/i + l 



(36) 



1 + rr^ 1 



where n = Cj^ / \J C'^^ iCf^' and the usual zero-signal variance term is 1/Fi = C'[^ iNf^ /[{2l + \){Cj'^'y'] which 

comes from the term involving A^j^^-* in Eq. (^6|). In the standard estimator, Eq. (|3^), the Si^ are weighted with the Fi^ 
and the normalisation is accordingly F ~ Fi-^. In the presence of a non-zero signal, we can reduce the variance by 
weighting the 5*;^ with the full inverse variance. This defines a lower- variance estimator for the bispectrum amplitude, 

(38) 




(39) 

When Fi-^ is large, so that the lensing modes arc reconstructed with a small error, the contribution of the signal 
variance terms l-t-r^"^ become important, ensuring that the total signal-to- noise never exceeds that expected from the 
cosmic-variance limit on the cross-correlation. Neglect of the signal variance term would lead to an overestimation of 
the significance for a detection of the lensing bispectrum (a similar effect happens with primordial non-Gaussianities'* 

. . . . rrr . 

The optimal Fisher variance in Eq. ( p9[ ) can easily be related to that for an optimal measurement of the cross- 
correlation, giving 

^-T. m 



i+c's..(cr+<')/(c) 

d from a: 

using the estimator iphmi for the lensing potential 



This is exactly the same result as obtained from an optimal estimator of the amplitude of the cross-correlation Cf^ 



25l use a single 
or lensing here 



* To account for the signal variance we have used an -dependent weighting in Eqs. (H) and (|39[); Creminelli et al. 
realization-dependent change to the overall estimator normalization, which should beless optimal. The argument 
can straightforwardly be generalized for estimation nf local non-Gaussianity, using a quadratic estimator for the small-scale primordial 
power modulation rather than the lensing potential |26 1 ; the corresponding estimator may be a fast nearly-optimal alternative to a fully 
Bayesian method p?) if the non-Gaussianity were large. 
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Note that here we have only discussed the optimal estimator from the measured cross-correlation (bispectrum). 
If auto-spectra (power spectrum and lensing trispectrum) are also included the variance can be reduced further^; 
however since the correlation r is always r < 0.5, and r < 0.2 where the signal-to-noise peaks even with no noise, the 
gain from a more optimal joint estimator is rather modest, being 0{r). We do not discuss joint estimators further 
here, but a likelihood analysis of actual data should of course properly account for the full covariance structure of the 
estimators used. 



B. Polarization 



The arguments above carry over quite directly to polarization. The original estimator for the bispectrum amplitude, 
Eq. (p6|), can be written as 

s-jH <''(A-oi):x™. (iv/°))"'^r.™„ (41) 

/i mi 

which involves the quadratic estimator 

^hmi^^h X/ ^hhh^'^hhhy X] (mi 7712 77^3 ) ^^tot ^l^^i^ima (^tot )f3''^/3m3 ' (^2) 
hl3 m2m3 ^ ' 

where the normalisation 



The overall normalization of S can be rewritten as 

F 



Here, in the approximations of Sec. IV 



A 



jk 



^3^1 ^2 



(44) 



(45) 



and is related to the squeezed limit of the bispectrum by B^^l^f^i^ = C^_^^ Aj'^i^i^. As for temperature, the quadratic 

estimator ^i-^m-^ is a non-perturbative generalization of those constructed in [|l^. Averaging over small-scale unlensed 
CMB modes and the modes of ip with (Im) ^ (limi) returns ipi-^mi to first-order since 



E 



^1 h h 
mi 77i2 



h h I3 



'i'2'3 ;* 



A' 

I1I2I3 ^/,* 



(46) 



The weighting in Eq. ( p2| ) minimises the Gaussian variance which is simply N^^"^ . 
The data enters the estimator, Eq. (41), through the empirical cross spectrum. 



(47) 



^ We thank the referee for raising this point. 



16 



As with temperature, there is a signal contribution to the (co)variance, since 



( ^a'- ill . i^ip 
y-'h + '-'tot/l'-'ii 

{2h + i)c;;^c°"^ 

\ l\ li tot^i li 

(2/i + i)cf;^cf;^ 



(2/i + i)cf;'^cf;'^ 



(48) 



The Gaussian contribution, (i^^^ ^)*p, to the covariance can be rewritten as the inverse of 



li<l-2<h 



^= E ^Xi.i^.iJ^^Z^^r^iC^lSfl,^ (no sum on z or p). 



(49) 



The original bispectrum estimator S = F^^ Tlirip ^h^h ^^^'^ ^he normalization can be written F = J^hip ^h- The 
Sf^ are weighted by their zero-signal inverse covariance; we can improve on this by using the inverse of the full 
Cov(S';'^, Sf_^). The resulting estimator has variance J-~^, where 



hip 



E 



iF-yP + 



li h 



{2h + l)Cf^'^Cl"^ 

where the matrix inverse is taken of the term in square brackets in the second line. 



(50) 



C. Detection significance 

Collecting our results for temperature and polarization, in Fig. ^ we plot the expected detection significance of 
the CMB lensing bispectrum as a function of the maximum observed multipole Zmax, assuming noise- free data and 
Planck-like noise levels. Here, we have only included the T and i?-polarization bispectra. Including _B-mode spectra is 
not expected to improve the variance significantly for large ^max^ despite the E-B estimator being the most powerful 
for reconstructing the lensing potential in the absence of noise using only noise-free T and E the statistical 
noise in the reconstruction is already below cosmic variance on large scales. The dotted and solid lines in Fig. |l^ 
give the results using the zero-signal Fisher error and the result including the additional cosmic variance due to the 
signal, respectively. The detection significance is bounded by the cosmic- variance limit on the detection of the lensing- 
potential cross-correlation power spectra. Planck should see the lensing bispectrum at about 5(7, while a zero- noise 
experiment should get very close to the cosmic variance limit of about 9a using temperature and polarization. 

Figure pi] shows the contributions to the signal-to-noise as a function of li, the largest-scale mode. The distinctly 
different dependence from the local /nl contributions is clear. The signal-to-noise peaks for li ^ 20, which is a 
trade-off between the C'f^ and Cf^' contributions falling rapidly with I, and the small number of modes at low li. 



D. Joint analysis of different bispectra 



The primary objective of non-Gaussianity searches is to look for primordial non-Gaussianity. which is expected to 
be small in many models. If we have a template primordial bispectrum B, the optimal estimator for small levels of 
non-Gaussianity would obtain a mean contribution due to CMB lensing given by 

,f . ^o(B'-,B) 

where the Fisher matrix for bispectra with zero expected signal is given by ^ 

pa, ^ ^(B^B'') = \ {Bt;^^{C^{C^^{C^:i^^^. (52) 
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FIG. 10: Fisher 'detection significance' (defined by 1/a — J-^^^ , Eq. (^o|)) of tlie CMB lensing bispectrum as a function of /max 
for no instrument noise and for Planck-like noise levels using just the temperature bispectrum (black for no noise; magenta for 
Planck) or using all the T and i?-polarization bispectra (blue for no noise; red for Planck). The dotted lines show the (incorrect) 
results obtained if the signal contribution to the variance is neglected: all results are bounded by the cosmic-variance detection 
limit on a measurement of the low-/ cross-correlation spectra Cf'^ and Cf^ . If the cross-correlation part of the signal variance 
is neglected (as for a null hypothesis test), the 'significance' is < 5% larger. 
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TABLE I: Errors and biases on CMB lensing and primordial local-model non-Gaussianity parameterized by /nl for Planck-like 
noise (assuming isotropic coverage over the full sky with sensitivity AT — AQ/2 — AU/2 = 50/iKarcmin [A^;"^ — N^^^/A = 
2 X 10^'* /iK^] and a beam FWHM of 7arcmin) or cosmic- variance limited data with /max = 2000. From Eq. (|6^) the errors 
cr/jjL s-nd cTicns are the errors on the amplitudes of the corresponding bispectrum templates individually when the other one is 
fixed; CT/J^^L^*^ is the Fisher error on /nl if the amplitude of the lensing contribution is marginalized over; and the correlation is 
that between the two bispectrum shapes. The bias is the systematic error on /nl if the CMB lensing contribution is neglected, 
i.e. Eq. d]). 

This lensing bispectrum eontribution to the primordial signal estimator must be carefully distinguished in order not to 
obtain a spurious detection. Whether the bias is large or not depends on how similar the shapes of the two bispectra 
are. Since the CMB lensing bispectrum is mostly in squeezed triangles, the bias is expected to be most significant 
for nearly-local forms of primordial non-Gaussianity: there is an expected bias of (/nl) ions = 9 using cosmic-variance 
limited temperature data to /„iax ~ 2000 Provided that the underlying cosmology is well understood, this bias 
may simply be subtracted. Even from current WMAP constraints, Cf^' is constrained to 0(10%) if we assume a 
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FIG. 11: Contributions (per In/i) to the Fisher inverse variance as a function of assuming no noise (top) and Planck-like noise 
(bottom). Blue lines are for the lensing bispectrum using only T (dashed) and T and iJ-polarization (solid). For comparison, 
the contributions to the inverse variance of the local-model bispectrum with /nl = 30 are shown in red. For such a large 
/nl (chosen for clarity in the plot), signal- variance contributions might be important but are not included here; however, the 
lensing results do include signal variance (in addition to the zero-signal term in the y-axis label). 



flat, ACDM cosmology with massive neutrinos". As discussed in Refs. ||^, |30| the signal variance of the CMB lensing 
bispectrum will also slightly increase the variance of other non-Gaussianity estimators, even if we assume that the 
shape and amplitude of the lensing bispectrum are perfectly known. Suppose that one obtains an unbiased estimate 
of some other bispectrum B by subtracting out the expected lensing contribution. If the estimator is not weighted 
accounting for the signal variance from lensing, i.e. constructed simply using Eq. (^6|), then the variance of the 
bispectrum amplitude, /nl, for squeezed shapes is 

r^TT (p^^ _|_ l(pT^\2 

var/NL = i VF-«) +i>:^^ri y'AFrf^^^^^^^-^^^ (53) 




2h + l 



where 



{h<l2<h) 

The first term in Eq. (|53| ) is the usual zero-signal variance while the second term gives an additive contribution from 
CMB lensing. For cosmic-variance limited temperature data to ^max = 2000, the analytical increase in the local /nl 



More rigorously, the standard deviation for independent samples of Cj from a MCMC exploration of the WMAP-f SZ-I-LENS seven- 
year data likelihood using COSMOMC psj] is ~ 10% for I < 200. The primary uncertainty is in the amplitude of the correlation rather 
than the shape. 
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variance is from 17.2 to 19.4, in good agreement with the resuh found by direct simulation in Ref. corresponding 
to a ~ 5% increase in the /nl error bar. For Planck the increase is smaller, ~ 1%. CMB lensing also affects the 
primordial bispectrum itself, by smoothing out the acoustic peaks, which would give some additional correction to 
the expected constraint that we have neglected here, but which is discussed in Ref. |^ . 

If instead we re- weight the bispectrum estimator in /i to suppress the cosmic variance, then for squeezed shapes 
the modified variance will be given by T]^ 



where 



^BB 




2 ~ 



h 



2/1 + 1 



(55) 



For cosmic-variance limited temperature data to /max = 2000, the optimized /nl variance variance is 18.6, rather 
smaller than the sub-optimal value above. However even with the optimized weighting, lensing still degrades the 
constraint since the optimized weighting is effectively decreasing the amount information available in the largest-scale 
modes. 

As in the temperature case, CMB lensing also contributes to the variance of other non-Gaussianity estimators for 
polarization, where Eq. (Fsj) generalizes to 



var/NL = var/NLU=o + (var/NL|v'=o) ^ ^'(^ 
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Here, 
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(57) 



generalizes the Gaussian inverse covariance of Eq. (^) to analyses of joint bispcctra, with an A component making 

the replacement B^i'lf^i^ ~^ -^hhh' "^^^ fractional increase in the /nl error bar is similar to the temperature case. 

As the detailed shape of the lensing bispectrum is quite different from that of primordial non-Gaussianity, the 
correlation between the two estimators is small and the amplitudes of the lensing and primordial bispcctra may also 
be fit for simultaneously, rather than simply assuming a fiducial lensing bispectrum. This is a safer method to correct 
for the lensing bispectrum, although it may result in slightly larger error bars (one could view this as a trade-off 
between systematic and experimental errors). If the amplitudes of multiple bispectra are estimated simultaneously, 
then the optimal estimator for the amplitude of B" with the first field at scale h and of type h can be written (in the 
approximation in which the inverse- variance filtering is given by the lensed power spectrum) as 



{ii<h<h) 

cah _ rp-llah.bi A -1 ( f>b,ijk \* r^,-l-iip r^-lpq r^-l-ikr npqr 
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(58) 



and the covariance will have contributions from the lensing variance 



Cov(5,7,^ff) = [i^^-ij-.fJ + j^.i™*^^' 



ra, lens c6, lens 



(2/1 + i)cr;^cf;'^ 



(59) 



in agreement with the previous result when just estimating the lensing bispectrum amplitude. Combining the infor- 
mation in all li and in all the fields, the overall optimal joint estimators therefore have Fisher matrix 



F' 



ah ^ ^ I — ljaz,6j ^a, lens lens 



(2/i+i)cr/c,t 



(60) 



where the inversion is of a large matrix with lumped indices ai and bj. This can be used to calculate the correlation 
between estimators, and assess the errors on individual bispectrum amplitudes with and without marginalization over 
uncertainty in the amplitudes of the other bispectra: 1 / F'^'^ gives the variance on bispectrum _B° if the other bispectra 
are fixed (known amplitude); [F~^]°'°' gives the variance if the other bispectrum amplitudes are marginalized over. In 

practice Eq. (|60|) involves inversion of singular matrices, and should actually be evaluated using matrices with C° ^ 
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factored out; lensing alters the error on other forms of non-Gaussianity even when C" (and hence the corresponding 
lensing bispectrum) is zero. 

With only temperature data the correlation between primordial local non-Gaussianity an d lensing is ~ 0.22 for 
Planck, which means that the error on /nl increases only by a factor y^[F~^^^ /[F^^]~^ ~ 2% if the amplitude 
of the lensing bispectrum is marginalized over (assuming the template is fixed). If polarization data is included the 
bispectrum shapes are even more different, so the correlation is even smaller. Table | contains a summary of results 
for cosmic- variance limited and Planck data, using temperature-only or temperature and iJ-polarization. 



VI. CONCLUSIONS 



Wc can summarize our current understanding of the lensing bispectra as follows. 

1. The late-time integrated Sachs- Wolfe effect gives a significant CMB temperature bispectrum. The signal is 
mostly in squeezed triangles, and strong enough to bias estimators of local non-Gaussianity if unaccounted for. 
The signal is thought to be well constrained even with current data, however, and also has a distinctive shape 
and scale dependence compared to that expected from any adiabatic primordial non-Gaussianity model, which 
allows the two bispectra also to be separated in data. 

2. The iJ-polarization from reionization is correlated with the lensing potential and gives a contribution to the CMB 
polarization bispectra. This is in principle detectable at ~ 2.5a, and, like the temperature lensing bispectrum, 
can bias polarization estimators for primordial non-Gaussianity if not accounted for. The signal is mostly on 
very large scales, falling rapidly with decreasing size of the largest mode. 

3. The leading-order perturbative calculation for the CMB lensing bispectrum is inaccurate at the ^ 10% level. 
There is a simple non-perturbative approximate calculation that agrees well with simulations, and is will ap- 
proximated by using lensed rather than unlensed small-scale power spectra in the standard lowest-order result. 

4. The covariance of the CMB lensing bispectrum has significant contributions from the signal variance, which is 
easily understood by thinking of the bispectrum as the empirical cross-correlation of the temperature with a 
quadratic estimator for the lensing potential. The detection significance of the lensing bispectra is limited by 
the cosmic- variance limit on the detection of the cross-correlation power spectra Cf'^ and Cf^, about 9(T in 
total. 

We have developed a numerical code to calculate the CMB lensing potential cross-correlation power spectra and the 
lensing and local-model primordial bispectra, which is publicly available as part of CAMB^. The public LensPix^ 
code can be used to simulate the properly correlated unlensed fields as well as the lensed CMB temperature and 
polarization. 

Note that although we have focused in this work on a CMB-only analysis of the lensing temperature and polarization 
bispectra, it is also possible to form bispectra involving mixtures of the CMB and other observables, for example B^^^ 
where A is some tracer of large-scale structure. CMB-large-scale-structure bispectra have already been detected, giving 
the first signatures of the CMB lensing effect |3^ , and a joint analysis with the other bispectra can slightly improve 
constraints on primordial non-Gaussianity pSL If A is only used on relatively large scales and can be treated as a 
Gaussian field, then our previous results immediately apply to these other forms of bispectra, for example on the flat 
sky the combination of Eq. (0) and Eq. (|o|) becomes 



(A(li)f(l2)f(l3)> 
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(61) 



Thus non-perturbative corrections can also be important for the cross-correlation bispectra. As mentioned in the 
context of Eq. (^T]), the corrections can easily be incorporated into the quadratic estimator framework for lens 
reconstruction by suitably substituting C"^^^ for in the weighting of the observed temperature flelds and the 
normalisation of the estimator. As shown in Ref. by a direct calculation, this modification to the estimator also 
improves the bias properties of the estimated lensing power spectrum on large scales. 
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We have focussed here on the hnear contributions to the CMB-lensing potential correlation. Future work should 
also incorporate an accurate model of Rees-Sciama and SZ contributions |3j, where the 'unlensed short- 
leg' approximation may be less accurate. In addition to CMB lensing there are also other non-linear effects at 
recombination that can give rise to non-Gaussianity even in the absence of a primordial signal [^ ; future precision 
constraints on primordial non-Gaussianity should also model these, and there may also be some overlap between 
the shape of non-Gaussianities at recombination and CMB lensing (for example modulation of the sound horizon at 
recombination by large-scale modes looks rather similar to lensing magnification of the last-scattering surface). 
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Appendix A: E—tp correlation from reionization 

For the calculations in the main text we compute the E-ip correlation accurately using CAMB |l4|. However, 
to build intuition, we give here a simple analytic treatment under the assumption that scattering at reionization is 
instantaneous. 

Consider an electron at reionization in a direction fi; its position relative to the origin (the observation point) is 
X = XroA where Xro ^ 10000 Mpc is the comoving distance to reionization. Linear polarization is generated at reion- 
ization by scattering of the local temperature quadrupole there. Reionization occurs well before dark energy becomes 
dynamically relevant and so the temperature quadrupole at reionization is simply a projection of the perturbations on 
the electron's last-scattering surface (which has a comoving radius Ax = x* — Xrc ~ 4200 Mpc, where x* ~ 14200 Mpc 
is the radius of our last scattering surface p^). The modes that contribute most to the quadrupole at reionization 
have wavenumber k « 2/4200 Mpc~^ and are therefore well outside the sound horizon at last scattering (where the 
conformal age is 264 Mpc). For adiabatic perturbations, the temperature quadrupole at reionization is therefore well 
approximated by the simple Sachs- Wolfe result, giving 

e2™(xrefi; vre) = " y / v*)e''' ''-%{kAx)Y*M, (Al) 

where the (dimensionless) temperature anisotropy for radiation propagating in direction e at position x and conformal 
time 77 is 6(77, x,e) = J2im 0im(x; r7)Yirri(e), and ^I' is the gravitational potential. The polarization that we observe 
from reionization along the line of sight n is 

iQ±iU){h) « -^^e2™(Xrcri;?7rc)±2l2m(fl), (A2) 



where r is the optical depth to reionization. Using Eq. (Al), we have 



{Q±^U)in)^^^ j ^^^vl/(k;,7,)e»''>^-"j2(fcAx)5^y;„(k)±2>2™(A) 



(A3) 



This is a pure E-modc signal, where^ 

(Q ± iU){h) = Y.^Ei^ ± iBi^)±2Yim{n), (A4) 



Our polarization conventions throughout this paper follow so the E and B multipoles have opposite sign to |40| and to the output 
of CMBFAST pl| and CAMB |l^ . The E—ip correlation we compute here therefore has opposite sign to that displayed in Fig. 
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with i?-modc multipoles 



'G + 2)!ji(fcXre) 



>^;™(k). (A5) 



{I - 2)! (fcXro) 

In the flat universe we are assuming here, the lensing potential of Eq. (^) has muhipoles 

^im = -Sm' J dx ^70 - xMkx)^ Y^M, (A6) 

so the E—tjj power spectrum is given by 



where the unequal-time power spectrum of ^ is defined by 

(vl/(k;77)vl/*(k';r/)) - |^7'*(fc; r;, r?')5(k - k'). (A8) 

To make further analytic progress, we neglect the late-time evolution of the gravitational potential which is a reasonable 
approximation since CMB lensing is most efficient around z sa 2 and, furthermore, low-redshift lenses will be only 
weakly correlated with the polarization from reionization. We therefore have 7^*(fc; 77*, 770 ^ x) ~ and, 
on the (large) scales of interest, this is directly related to the power spectrum, V-jzik), of the primordial curvature 
perturbation and so is nearly scale-invariant. Defining the integral 

^ / dlnkMkAxY^I^ ( r dx^^jiikx)] , (A9) 

J [kxror \Jo X*X J 

we can approximate 




(AlO) 

where V^i{k]r}^,) = %Vn{k) / 'i.b ~ 9^^/25. The integral Ii can be (very) roughly approximated by Limber- 
approximating (e.g. in the integral over x then again in the integral over k: 

Ii-^^^^^^MlAx/xro). (All) 
zt X* 

This approximation is only good to around 50% (away from zero crossings) but does capture the basic shape of the 
Cf^ correlation in Fig. |[ In particular, we expect [l{l + l)]^^^Cf^ cx j2{lAx/Xic) which gives rise to the oscillations 
seen in Fig. ^. At a given I ^ 2, most of the correlation is from wavenumbcrs k ^ l/x^c', the angular projection of 
plane-wave fluctuations in at wavenumber k over the electron's last scattering surface gives a local temperature 
quadrupole going as j2(fcAx). For / = 2 where the signal is largest, the dominant contributions arc actually from 
k ^ 4/xre as shown in Fig. g. 

On large scales, Cf'^ > (< in the CMBFASt/camb conventions adopted in Fig. ||), corresponding to radial 
polarization around large overdense lenses. The part of the T-E correlation from reionization coming from temperature 
anisotropics sourced at last scattering has the opposite sign to C^^^ on large scales since large-scale overdensities on 
the last-scattering surface appear cold. However, the contribution of the late-time ISW effect to the T-E correlation 
from reionization is positive on large scales since the decay of large-scale potential wells at late times produces positive 
temperature fluctuations. 
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Appendix B: Perturbative temperature lensing bispectrum on the flat sky 



Expanding r(x) — T{x. + V'i/') to third order in tjj we have 
d^li d^L 



T(l) = T(l) - 



27r 27r 
1 /■ d^li d^la d^L 



T(L)?/'(li)li • L(2^)5(li + L - 1) 

r(L)^(li)V'(l2)li • LI2 • L(27r)^(li + I2 + L - 1) 



2tt 2tt 2tt 



1 /■ d^li d2l2 d^la d^L 



6 



27r 2tt 2tt 2-k 



T(L)^(li)V'(l2)V'(l3)li • LI2 • LI3 • L(2^),5(li + I2 + I3 + L - 1). (Bl) 



For l\<li< I3 and li ^ ^3 (assuming Cf'^ w for large / ~ hJs, and T(li) = T(li)) to third order in ip we 
have 



Here we have defined 



Tip 



\,-hci:'{i-iiR^')+ 

- ilR"") + 



d^L 



li.LCi^C[i7_L|[(l2-L).L] 
(2.)2^^rC,t%[(l.-L).L]^ 



(27r)2 
d^L lo-L 



(I2 ^ I3). 

- (I2 o I3) 



(B2) 
(B3) 

(B4) 



which is half the total deflection angle power. The term in square brackets in Eq. (83) is just the second-order result 
for Cf^^"^ . If we approximate li • L « li • I2 (corresponding to a 'large-lens' approximation, where Pcf^ falls rapidly 
at high I) and use the second-order result for the lensed power spectrum l]!^, 



cr « (1 - + 



1-1' r 



Eq. (B3) is then equivalent to the first-order result but using the lensed power spectrum: 



(B5) 



(B6) 



In Eq. (B3) (and the main text) we assumed that r(li) = T(li). If we relax this approximation there are additional 
third-order terms. Keeping only terms involving Cj^ at low I where it is non-zero, there are two contributions: from 
0{ip) X 0{ip) X 0{il^) and 0{if) x ©(V-^) x 0(1), 



(r(ii)r(i2)T(i3)) « (r(ii)T(i2)T(i3)) - —5{h + h + h 

ZTT 



d^L 



|[(li - L) • L][(l2 + L) • L][(L - li) • (L + h)]C^J+^ - [(li - L) • L][l2 • L][(L - h) ■ yqT.T} + (I2 ^ I3), (B7) 

and an additional small term from 0{ip'^) x 0{t/j) x 0(1) which is down by a power of li/l2- Individually the separate 
terms in Eq. (B7) arc significant (if not large), but since L is small, for large II2I we have I2 -|- L w I2, and the terms 
nearly cancel. Using r(li) = T(li) is therefore a good approximation, as expected on physical grounds for small li 
and verified with simulations in the main text. Corrections from Eq. (B7) are fractionally most important for triangles 
where li is orthogonal to I2, where the signal is small anyway, and remain sub-dominant to the correction obtained 
by using the lensed rather than unlensed power spectrum. 



Appendix C: Calculation of the gradient power spectra 

In this appendix, we calculate the power spectrum of the TVT correlation, and the equivalent results for the 
polarization. We will assume here that we can neglect T-ip correlations for this calculation. As we only require Cf^^ 
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on small scales where the ISW contribution is small this should be a reliable approximation. We start with the flat-sky 
limit. Following ||ll|, Q we construct the correlation function as a function of r = x — x': 

X(r) = r • (VT(x)f (x')) = r • ([VT](x + a)T(x' + a')> 

= f il^^(e'Hx+«)g-l'.(x'+«')^)^(J^..l)y(l)y(l/)*^, 

27r 27r 



(27r) 

f^e-'^-^M/^l E [^2„-iar)-J2„+iar)]/„[;2Cg,2(r)/2]. (CI) 



n— — oo 



Here, we have defined 4> as the angle between 1 and r, (T^(r) = ((a— a')^) /2 as half the variance of the relative deflection, 
and Cg\^2{'r) as the non-isotropic part of the correlation function of a: Cgi_2(''') = —2P0{aia'j) where f = r/r and 
angle brackets around indices denote the symmetric, trace-free part. Bessel functions and modified Bessel functions 
are denoted by Jn{x) and /„(a;) respectively. Note that r • (Vr(x)'r(x')) has no component perpendicular to r so the 



correlation is fully described by xl*")- Expanding Eq. (CI) gives the leading terms 

Xir) = - / f^e-^'^'^^y^lr (j,(/.) + [Mir) - ^Ir)] 

Transforming the correlation function we then have 



See Fig. ^ for numerical comparison with the lensed power spectrum. 
For the polarization we have^° 



and 



n— — oo 



diner +cr)^M^ 



2ti 



e-'^-'W/^Zr [j,{lr) + ^^^^ [Ulr) - J,{lr)] + 



(C2) 



= -27r / rdr'^xir). (C3) 



P(l) = E{\) + iB{\) ^- J ^P(x)e-2''^'e-''", (C4) 

P*(l) = £;(!)- ^B(1) = - / ^P*(x)e2"^'e-'' ", (C5) 

J 27r 



^ :P{h) = -;^e2'('^'3-*'2)li. VP(li+l2), (C6) 



<5V(li)* 27r 

P*(l2) = -^e~^''^^'^-'^'2\ ■VP*{li + h), (C7) 



(5?/'(li)* ' ' 2tt 

where I3 = — li — I2. 

We define <f> = <f>\ — <f>y- and ^ ri zL ie^iTj, so that 

• 1 = rle"^"^ i^^^hr^ = T rle^^'^ , (C8) 

and introduce the correlation functions 

X+{r) = r- (VP*(x)P(x')) =r- (VP*(x + a)P(x' + a')) 

/A] l2fr'EE , r<BB\ ;„ °° 

y ^ ' ' ) e-'^-^(-)/^| E [J2„-i(M-J2„-M(M]/„[;^C,K2(r)/2] 



(C9) 



^0 Note that P*(l) is no* the complex conjugate [P(l)]* of P(l); rather P*(l) = [P(— 1)]* so that E{\) and B{1) are the Fourier transforms 
of real fields with e.g. [E(l)]* = E{-1). 
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where 



= -27r 



rdr 



Ji{lr)x+{r). 



^EVE _ qBVB 



rdr 



C, 



^TVE 



{xZ{r)J3ilr) ' xt{r)Mlr)} , 
27T^^ {x-{r)J3ilr) + xt{r)Mlr)} , 
-27r / !±{^+ir).Mlr)-x^{r)Mlr)}, 
-27r/ !±{^+ir),Mlr)+x;,{r)Mlr)}. 



(C18) 
(C19) 
(C20) 
(C21) 
(C22) 



In the absence of lensing Cf "^-^ ^ Cf ^, C^^^ ^ Cf^ and ^ Cf ^, but ^ and ^ 0. Results 

required for the bispectrum then fohow. For example, using 



B{h) 



1 

2^ 
1 

2^ 



S{h+h + h) 

,5(li+l2 + l3) 



li • hCfJ^ sin 2(01, - 0,3) + -e.^lUlCr/^ cos 2(01, ~ 013) 
li • l2Qf sin2(0i, - 0,3) + ^e,dil\CP/^ cos2(0i, - ^13) 



(C23) 
(C24) 



we have^^ 



{T{h)B{h)E{h)) 



Tip 



li-(Qr"l2 + Qr"l3)sin2(0r 



(C25) 



The leading-order result sets the Cf^-^ terms to zero and replaces Cf'^^ and CP^^ with Cf'^ and C^^ respectively. 
The contribution vanishes if there are no unlensed B modes. The approximation made in the text to capture 

the main, non-perturbativc corrections to the leading-order result is to replace Cp^^ with Cp^ . This neglects the 



CP^^ and Cp^^ terms in Eq. (C25). These are of similar magnitude to the lensed i?-mode spectrum, CP^ , and 
much smaller than the change in Cp^ due to lensing which therefore dominates the corrections to the leading-order 
bispectrum. 



1. Full sky 

In the semi-squeezed limit of most interest we can accurately approximate the bispectrum of the lensed fields as 



B 



ijk 



E 

77117712 7713 



h h h 
mi TO2 TO3 



a, 



lim2^'l2rn2"'l3Tn3 



0,1. 



771177727773 



h h h 

mi 771 2 7713 



(C26) 



We shall pursue a non-pcrturbative curved-sky analysis in the approximation in which sky-curvature effects on the 
arcminute-scale of the deflection angles can be neglected, similar to the approximations used when calculating the 
full-sky lensed power spectra [ pT| . Acting on a lensed field of any spin we write 



1 / 5{ia) 



^ ^ 5{-ia) 

/^ri7ni " ^ Him, 



5 



Vhih + 1) 



hni2 



(C27) 



Note that the flat-sky bispectrum of an odd-parity product of fields, Hkc T, B and E, depends not only on the lengths li, I2 and /a but 
also which of the two parity-related configurations of the three vectors is being considered. The bispectra for the two configurations 
have opposite signs. 
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where we used spin ±1 components of the deflection field ol = ^ijj expanded in terms of spin ±1 spherical harmonics 

VW+^ -lYlmi^lrn- (C28) 

Im Im 

The combination — i(iQ!9+-iQ!S) is the spin- weight analogue of a • V acting on a rank |s| tensor field ||44| . 
For a lensed spin S2 field = -sjX* we then have 



Sip* 



Zimi 



where 



(C29) 



I drift { (n) [ g (n + a) - iy,*,„^ (n) [ 3 (n + a) } Y*,^^ (n) 



E 

I'm' 



^h{h + 1) /(2;i + 1)(2Z2 + 1)(2/' + 1) Ml /2 V 



An 



mi 7712 'Tl 



h h I' 

-1 S2 1-S2, 



?i /2 

, 1 S2 -S2 - 1, 



[Ss2X](n + a) = ^^.2X+„,,,2-iY/'m'(n)K;,^2 = ^ ^a-i^^i'm' (n + q:)s2-''^ 



(C30) 



(C31) 



and 



^4- — 



= + s{s + l) = + S{S-1). 



For the polarization we can then write 
6 



llTIll 



{2Phm2 ± -2-P/2m2) — E 



where 



1 /(2Zi + l)(2;2 + l)(2Z' + l) / h I' 



47r 7712 ™' 

3+* -I- /" 1^p^ D^* I £)+* 



+2M, 



h I' 
-I 2 -1 



-[?iai + l)+/'(/' + l)-;2a2 + l)] 



/l /2 /' 

2-2, 



;'2 



(C32) 



(C33) 



2Mi,z- 



v/^iCi + i) 



?2 I' 

-1 2 -1 



^('2 + 



h h I' 

1 2 -3, 



i'2 



(C34) 



and the parity is determined hy p = li + I2 + Iz- 
The required power spectra are of the general form 



CI 



vw 
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2TT I AcosMss'Wm 



(C35) 
(C36) 
(C37) 
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where the last Une follows easily from rotation invariance if n is chosen to lie along the z-axis and n' is in the x-z 
plane at an angle (3 to the z-axis (using (i^^(O) = Sgm and the relationship between spin- weight spherical harmonics 
and Wigner functions). The correlation functions can be calculated following the method in Rcfs. ||l^, |45) where we 
make the same choice of n and n': 

= ^Cr^(.l^™(n + a),,yz,„(n' + a')) 

Im 

hnm' 

- E ^Cr'e-'('+^"^^/^ E + l)Cgi.2/2]dU,-„+.'. (C38) 

/ n— — l 



Appendix D: Simulating the correlated unlensed fields 

Given a method for generating independent Gaussian normal variates with unit variance gr, gE and g^, the unlensed 
full-sky temperature, polarization and Icnsing potential multipolcs at a given / and m can be generated with the correct 
correlations using the Cholesky decomposition of the covariancc matrix between the three fields. Explicitly, this is 
given by 



Elm ~ 
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EE 
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(Dl) 
(D2) 

(D3) 



The lensed field can then be obtained by re-mapping points by the lensing deflection angle; this is implemented in 
the pubhc LensPix^^ code [|6, 47 



As written above, the temperature realization is generated first and correlations to the other fields are generated 
by using gx when they are simulated. Permuting fields, e.g. T -^r^ ip, similar expressions can be obtained where other 
fields are generated first. 



Simulation for lensing bispectrum estimation 



For simulation of a bispectrum it can be useful to reduce the variance by using an estimator that subtracts off most 
of the random realization-dependent scatter in the bispectrum estimator, while leaving the estimator unbiased 
e.g. 



ijk 



E 



h h h 
mi m2 ma 



ai' 



0'= 



(D4) 



where 



— 7Tll 



7712 



and 



0. Note that, here, the overbar denotes counterterms in the 



bispectrum estimator rather than the different usage in Eq. (^3|). When simulating the lensing bispectrum the 
simplest method would be to take a;,„ to be the unlensed field. However since the lensed and unlensed multipoles 
decorrelate at small scales (lenses shift the field around, mixing up the l,m), this does not help very much on small 
scales. It is therefore preferable to use a lensed realization for dim where the fields are as close as possible to the aim 



^ ittp : //cosmologist . inf o/lenspix/ 
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realization but constructed to have zero bispectrum in the mean. For a given random seed, one way to achieve this is 



to construct t/jim from Eq. (D3) with the correlation terms omitted, i.e. set gr = 9e = 0. This does succeed in reducing 
the variance, but it is still significant since the large-scale lenses arc modified by removing the correlations and this 
alters the small-scale lensed aim- If we are only interested in simulating Bi-^i^i^ for a particular range of {/i}, we can 
instead generate counter-term realizations as above but with ipim only modified for I € {h}, and Tim,Eim modified 
for I ^ {li} This significantly reduces the change in the -0 field, and hence the difference between the small-scale aim 
and aim reducing the variance even further. In the main text, when simulating Bi-^i^i^ for li = 4, we simulate the 
I = 4 unlensed fields using a 'T-first' algorithm as described above, but generate all other multipoles using a '^/j-first' 
algorithm, with aim generated using the same random seed but terms correlating ip to T and E set to zero. This trick 
works better for higher li (our li = 50 results) where the correlations are small, but gains less for /i = 4 where the 
correlations are significant so the zero- bispectrum field is still significantly different. 

Appendix E: Tensor vectorization and forms of the bispectrum covariance 

Here we generalize some results for matrix vectorization to tensors, aiming to derive the general result that gives 
the equivalence of the various forms of the bispectrum covariance. For further details and references for matrix results 
see e.g. Refs. ||. 

The elements of a general matrix A can be assigned column-wise into a vector vec{A). We can extend this to a 
3-tensor B so that 

vec(B) = (-Bill, -0211, ^311 ■ ■ • -B„ii, -B121, -B221, -B321 . . . (El) 
and a 3-tensor contraction can be written as 

B^JkD^Jk = vec(B)^vec(D). (E2) 
The Kronecker product of an m x n matrix A with a p x q matrix B is defined to be the mp x nq matrix 



A<S)B = 



f AuB A^B ... Ai„b\ 

A21B A22B ... A2nB 



(E3) 



We can write a matrix-tensor contraction e.g. as ^{F,G,H), so [B(F,G, H)]ijk = FipGjqHkrBpgr- Wc can then 
write 

vec(B(F, G, H))^{H®G® F)vec(B). (E4) 

Using this we have 

B:%,(C-X'^'')i:iC^XBfZi. - vec(B,^,, J^(C,;i ® Cf^' ® Cr^')voc{B,^i^jJ. (E5) 

This result extends straightforwardly to higher-rank tensors (e.g. for higher n-point functions), following the gener- 
alization below, where we also consider the case where the tensor may be symmetric on one or more indices as when 
h, I2 and are not distinct. 

Results for fc-tensor and symmetric tensor vectorization 

The vector vec{A) is the elements of a general matrix A assigned column-wise; wc can extend this to a fc-tensor B 
so that 

vec(B) = (-B11...1, _B2i...i, -B31...1 • • ■ Bni...i, B12...1, B22...1, B32...1 ■ ■ ■ B 

nn.. .n Y- (E6) 

Full contraction of fc-tcnsors can then be written as 

Ba^a^...a>,Da^a^...a^ = Vec(B)'^Vec(D). (E7) 
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We can write a matrix-tensor contraction e.g. as B(iVf^, M"^, . . . , M^), so 

B{M\M\ . . . , M^)],,,,...,, Ml.Ml,^ . . . M^.^B,,,,,,,,, (E8) 

and 

vec(B(M\M2, . . . ,M'=)) = {M'' ® ■ ■ ■ ® ® Mi)vec(B). (E9) 

For a fully-symmetric n-dimensional symmetric rank-fc tensor there are only Snk = ^kiln-iy distinct elements: for 
fc = 3 there are n{n -I- l)(r7, + 2)/6. A tensor which has s of k symmetric indices has Snks = TT^^r^T)!'"''^"^ distinct 
elements; we define vecp(B) to be the corresponding vector of distinct components of B. In general vecp(B) is the 
vector of distinct components of the symmetrized tensor, for example if B is symmetric in its last s indices, 

veCp(B) = i?i...i(i...i2), ^1... 1(1.. .22)1 Si. .12. ..22, ■•■ Sl...l(l...nn): S„...„„...„„^ . (ElO) 

In terms of components, for a fully-symmetric tensor [vecp(B)]i = i3(aia2...afc) where aj < a^+i and 

Q2(a2 - 1) , (as - I)a3(a3 + 1) (afc + fc-2)! 

' = + 2 + 6 + •••+ fc!(a.-2)! " ^^^^^ 

It is sometimes useful to write the vector index in the form [vecp(B)]aja2...aj.. In general a lumped index can be used 
with additional non-symmetric indices. 

The rank n'^ x Snks matrix Bnks is defined so that for a tensor A 

vecp(A) = Bj,,vec(A). (E12) 

For example, forA: = sa2x2x2 tensor A has 

BlfesVCc(A) = (Ahi, (^211 + A121 +^ii2)/3, (^221+^212 + ^122)73, A222) ■ (E13) 

Explicitly 

[BnksUa....a.A.^,...AM.M + ^...B, = ' ' ' ^'a} ^tXl ' ' ' [for h < fe^+l] (E14) 

(we are putting the non-symmetric indices last for convenience). The pseudo-inverse B^f,^ = [B'^f.^Bnks)''^ Bl^f.^ can 
be used to construct vcc(D) from vecp(D) when D is symmetric: 



vec(D) = (B+ J^vecp(D), (E15) 
matrix for symmetrizing vectorized tensors, with components 



and in general gives the vectorization of the symmetrized tensor. The symmetric matrix BnksB^f.^ is the n'^ x n 



{BnksB:^^^)a^a2...a,A, + i...AkMb2-bsB, + i...Bk - ^ll^^ll ■ ■ ■ ^IiI^aIH ■■■^aI- (^16) 

Since 

(M'^ • • • ® M2 ® M^)a,a,...aM...b, = M^Ml,^ . . . Ml,^, (E17) 

we have 

BnksB+t^^{M^ M'+^ ®M®---®M)^ {M^ M'+^ ® M • • • ® M)B„fe,B+j^^. (E18) 
Since {A B)^^ = A^^ (E) B^ (for non-singular matrices) it follows that 

[B^ksiMk ® • • • ® Ms+i (E) M (g) ■ ■ ■ (E) M)BnksV^ = B+^^JM'^ ® • • • ® M7+\ ® Af^ E) ■ ■ ■ E) M-i)(B+^J^(E19) 
Hence for symmetric D and B symmetric on s indices we have 

vec(D)^vec(B(Cr\ Cr\ . . . , C,,+,, . . . , C,;i)) 

= vecp(D)^[B,^;,,(C/, • • • (g) Ci^^, (g C (g) • • • (g C)B„fcs]^^vecp(B). (E20) 
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Bispectra 

A bispectrum estimator using a vector of different fields a;,„ for a specific set of {/} is 

\ijk _ / ^1 h h 



771177127713 

or equivalently 



Bl^ls= E ]<n..<rnA^. (E21) 

2 J L / 1 j-^j^ 1 



■■^^ — ' \ mi mo m-i j 

771177727773 \ "^Z 

If we restrict < ^2 < ^3 then vec(B;^;^;^) is uncorrelated to other vectors with different {/}-labels. 

Using [A ® B)^ — (g) and the general result (for appropriately sized matrices) that {A ^ B){C D) — 
(AC) ® (BD) gives 

{ai.,m.., ® a;„„, ® ai,rm){a-i^m'^ ® a,,„/ ® a,^„/ )^ = (ajg^gaj^^^) ® (a,„„,a;t^^, ) ® (a,,„iia|'^„^, ). (E23) 

Hence using (ajmal,^^^,) = Swdmm'Ci, where Ci here is the total covariance matrix including the effects of instrument 
noise, gives 

(vec(B,^,^,3)vec(B,^;^,3)t) = At,i,i,Ci, ® ® Ci„ (E24) 

where Aj^i^jg = 6(5|J^(5;M|^^ (no implicit sums over Z-labels: Ai-^i^i^ is 6 if = ^2 = '3, 2 if two of the indices are 
equal, and 1 otherwise). Note that this matrix is not invertible if any of the I indices are the same since the tensor 
is then (partially) symmetric, B^J^, = Bf^^, , so vec{'Bi_^i^i^) is a redundant set (some items are perfectly correlated, so 

the covariance is singular). For duplicate indices we therefore use vecp(B;^;^;3) instead, so get a vector of only the 
distinct components, and define 

Covi,z,Z3 = (vecp(B,^;^i3)vecp(4^i^i3)t) = Ai,i,i,B^^,,^Ci, ® Ci, ® Ci,B„ks, (E25) 

and it is understood that vecp = vec (and Bnks = I) if hi h, h are all distinct. 

Following Ref. Q we can find a set of weights to get the estimator S = J2ii<i2<h '^lihh^^'^^^—hi-^h') ^ which has 
Gaussian variance 

{S')^ J2 ^LlCo^rl^hh^hhh■ (E26) 



h<h<h 



Minimizing subject to unit response to B gives Wjjijjg = (S'^)Cov;^;^^;3vecp(B;^;2j3). Using Eq. (E20) the Fisher 
inverse error variance is then 

E vccp(B,^,^,3)tCov-i^,3vecp(B,^;^;3) = ^/Ti2i3^ecp(B,^,^;3)t[B^,^C,3(g)Cz, ®a,B„fe,]-Vecp(B,^;^,3; 

h<l2<h h<h<h 



= E A-i^,3vcc(B,^,^,3)tc-i«C,;i®C,;^ec(B,^,^,3) 

h<l2<l3 

= ^ E vec(B,^,^jtc-i«q;i®C,;Vec(B,^,^J 
= I E iB:^l.r(C~X(C~XiC-XBfZi.- (E27) 



6 

^1/2^3 



This establishes the correspondence between the (zero-signal) variance in terms of the covariance of the elements, the 
result of Ref. [|49[ a nd that obtained from an Edgeworth-expansion of the non-Gaussian likelihood (note the appendix 
[Sec. 6] of Ref. |f9[ is somewhat misleading: the vector of components of bispectra using T and i?-polarization have 
to include all eight possible terms with distinct h, I2, h)- 
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